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Abstract. In this paper, we give a general framework for the Boltzmann gen- 
eration of colored objects belonging to combinatorial constructible classes. We 
propose an intuitive notion called profiled objects which allows the sampling of 
size-colored objects (and also of fc-colored objects) although the corresponding 
class cannot be described by an analytic ordinary generating function. 

Efficient generation of extremely large objects is needed in many situations. For 
instance in statistical physics for observing limit behaviors [5] , in biology for under- 
standing and analysing genome properties |12j . or in computer sciences for testing 
programs [3] or simulating and modelizing Network as Internet [7J [3]. In 2003, 
Duchon, Flajolet, Louchard and Schaeffer [5] proposed a new model, called Boltz- 
mann model, which leads to systematically construct samplers for random genera- 
tion of objects in combinatorial constructible classes. These samplers depends on a 
real parameter x and generate an object a in a combinatorial constructible class A 
with a probability essentially proportional to x' a where a| is the size of a. Hence 
they draw uniformely in the class A n of all the objects of size n in A. The size of the 
output is a random variable, and parameter x can be tuned for a targetted mean 
value. Moreover using rejection, one can obtain exact size samplers or approximate 
size samplers. This new approach differs from the "recursive method" introduced 
by Nijenhuis and Wilf [TT] by giving the possibility of relaxing the constraint of 
an exact size for the output and this implies a significant gain in complexity: no 
preprocessing phase is needed and expected time complexity is linear in the size of 
the output. 

Boltzmann model has been described both in the cases of unlabelled combinato- 
rial constructible (also called specifiable or decomposable) classes [3] and labelled 
combinatorial constructible classes [HE], for the most classical constructions (+, 
x, Seq, Set, Cyc,...). In this paper, we are interested in the generation under Boltz- 
mann model of size-colored combinatorial objects. Let a be a combinatorial object 
with n atoms, we say that this object is size-colored or colored if each of its atoms 
can be colored with a color in {1, n}. Our main motivation for this study stems 
from the following situation : Consider that the construction of an object of size n 
is distributed to n heterogenous processors. Each processors signs the atoms that 
it has build. A size-colored object is exactly a possible way to build such an object 
and we would like to highlight some properties about it. In order to do that, we 
are going to deal with well-known fc-colored combinatorial object (in this case, each 
atom can be colored with a color in {1, fc}, and fc does not depend on the size), 
fc-colored classes can be found in numerous problems as the fc-colored necklaces |10j , 
expression trees with fc types of n-ary functions, fc-colored planar trees, fc-colored 
Motzkin paths [13],... 



l 



2 



O. BODINI, A. JACQUOT 



For instance, all the 2-colored non-planar general trees of size 3 are : 

111122221 1 1 1 2 2 2 2 

i i i i i i i i / \ / \ i / \ . / \ / \ / \ , / \ . / \ 

1 1 2 2 1 1 2 2 11 12 (= 2l)22 11 12 (=2l)22 

1 1 1 1 1 1 1 

12121212 

Each object has to be drawn with the same probability jg. This example points 
out that symmetries increase the difficulty to build an uniform sampler. That can be 
also observed by comparing Boltzmann sampler codes for labelled and unlabellcd 
structures. In a naive approach, we could deal with in the classical Boltzmann 
model by duplicating k times atoms, but we give here a more general and efficient 
point of view. In particular, we obtain a relatively compressed storage that we 
explain in a later section and we can postpone the draw of the colors to avoid the 
unnecessary coloration during the rejection phase. 

A profiled object is by definition an object associated with a set-partition of its 
atoms. This concept is derived from Polya theory [T]. We prove that this structure 
is lifted to all /c-colorations. Roughly speeking, the atoms belonging the same 
partition-class could be interpreted as having the same colors. More precisely, for 
every k, a sampler for profiled objects can be specialised (by a good choice of the 
parameters) to become a sampler for any fc-colored objects, so that, in a sense, our 
model unifies all fc-colorations. Espacially, profiled object samplers can be adapted 
to efficient sampling of colored object in approximative size. Profiles enable us to 
generate colored objects with Boltzmann methods even if the ordinary generating 
function is not analytic in (which is an obstruction in classical Boltzmann theory) . 
For instance, let A be the unlabelled constructible class Seq(iJ), it is well known that 
the ordinary generating function is 1/(1 — z). In this case, the ordinary generating 
function for the class A of colored objects is n n z n which is clearly not analytic 

fc>0 

in 0. At first sight, it seems impossible to have a Boltzmann-type sampler for this 
class, since the parameter x has to be in the disk of convergence of the generating 
function. 

This paper is organised in four sections. The first section is devoted to the defini- 
tions of various taggings for combinatorial classes. We define a general framework, 
in terms of language, that allows to describe unlabelled and labelled classes as well 
as less studied classes such as semilabelled or colored classes. We also recall the 
notion of constructible classes which is central for Boltzmann sampler theory. The 
second section adresses the notion of profiled objects. We define the generating 
function associated to a profiled combinatorial class and express its relation with 
the generating functions of the associated /c-colored classes. 

In the third section, we construct Boltzmann samplers for the profiled combina- 
torial constructible classes and we show how to use it to obtain samplers for the 
/c-colored combinatorial constructible classes. 

In the last section, we propose an approximate size sampler for the colored 
combinatorial constructible classes. This sampler is based on a filter that allows to 
transform a sampler for n-colored objects into a sampler for colored objects. 

1. Combinatorial Classes 

Combinatorial classes are very well studied and classical objects [5]. We propose 
in this section a general framework, in terms of language, that allows to describe 
extended tagged combinatorial classes. Our approach can be seen as an introduction 
to a very simplified species theory pQ. Let 2t be the alphabet on the 7 following 
letters □{}[](). The square is called the atom. The other letters correspond to three 
different types of parentheses. 
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We denote by L the language on 21 defined as follows : 

• □ ,{},(),[] belong to £ 

• Vfc e N*, A\, A k e L => (A 1 A 2 ...A k ) e L, the sequences 

• Vfc e N*, Ai, Ah € L => [Ai^.-.Afe] e £, the cydic sequences 

• Vfc e N*, Ai, A fe e £ =>■ {Ai^.-.Afc} e £, the multisets 
The elements of £ are called the combinatorial proto-objects. 

The size of a proto-object A is the number of atoms contained in A. That is 
to say, the number of occurence of □ in A. For instance, ({[□□](□□)}□) is a 
combinatorial proto-object of size 5. 

Let A be a proto-object, we denote atom(A) the set of the occurences of □ in A. 
In particular, \atom(A)\ is the size of A and each element of atom(A) corresponds 
exactly to one occurence of □ in A. 

At this stage, we just have a support but we need to explain the meaning of the 
cycles and the multisets by some equivalences between proto-objects. In order to 
do that, let B be a set, a B-colored proto-object is a pair (A, f) where A is a word 
of L and / is a function from atom(A) to B. Let a be an atom of A, f(a) is called 
the color of a. We denote by Lb the set of all i?-colored proto-objects. 

We define the equivalence =b between £>-colorcd proto-objects of Lb as follows : 

• Two -B-colored squares are equivalent if and only if they are the same color : 

(□,/) = B {n,f)& /(□) = /'(□) 

• Two sequences are equivalent if their corresponding substructures are equiv- 
alent : 

Wli, A k ,B u ...,B k e L, ({A!A 2 ...A k ), f) = B ((B 1 B 2 ...B k ), /') 

Vi,f(Atom(Ai)) = f'(Atom(Bij) and {A,, f\ Mom(Ai) ) = B {B l J'\ A t om (B t )) 

• Two cycles are equivalent if their substructures are equivalent up to a cir- 
cular shift : 

VA , A k _ u B , Bfc_i G L, {[A A x ...A k _ x ], f) = B <[B --Bfc-i], /'} 

<=> 

3nV«, f(Atom(Ai)) = f'(Atom(B i+n mod fe )) 
and 

(Ai, f\Atom{Ai)) =B (Bi+n mod fe) / | Atom(B i+n mod k )} 

• Two sets are equivalent if their substructures are equivalent up to permu- 
tations (we denote by S k the symmetric group of order fc) : 

V^o,...,A fc _i,Bb,-,-Bfc-i G £,<{4>-4t-i},/> = B ({B ...Bfc-i},/'> 

<^> 

3a 6 S fc) V^,/(AtoTO(A ^ )) = /'(Atom(B CT(l) )) 
and 

(Ai, f\Atom(Ai)) =B (B a (i), f'\Atom(B„ (i) )) 

The set Ob of B-colored combinatorial objects is £b/ =b- Now, we can define 
the fundamental notion of combinatorial classes as follows : 

Definition 1.1. A B-colored combinatorial class is a sub-multiset of Lb I =b with 
a finite number of object of each size. 
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We opt in this definition for a multiset version which avoid the traditiomial 
problem of the disjoint union. The union A U B of two multisets is the multiset 
obtained with all the existing elements and where the multiplicity of a in A U B is 
the sum of the multplicity a in A and the multiplicity of a in B. 

The size of a i?-colored combinatorial object A is the size of any proto-object 
which represents it. 

Every object of size is called neutral object. For instance, {()()[]} is a neutral 
object. Clearly, Ob contains an infinite number of neutral objects, we carefully 
avoid the confusion between the empty combinatorial class and a combinatorial 
class reduced to a neutral element e. 

For example, the following {1, 2}-colored combinatorial class C contains 3 objects. 
The both objects of size 3 are identical. 

Now, we can add some restrictions on / to obtain different types of tagged com- 
binatorial objects. Some of them are classical (unlabelled, labelled), the other ones 
are more unusual but reasonable. 



Function 


Codomain 


Labelling 


one-to-one map 


{1, n} for an object of size n 


labelled 


constant map 


for instance {1} 


unlabelled 


quasi-constant map 


{0, 1} (3!yVx- + yf(x) = 1 and f(y) = 0) 


pointed 


surjective map 


{1, k} with k < n 


semi-labelled 


arbitrary map 


{1, ...,n} for an object of size n 


colored 


arbitrary map 


{1, k} with k fixed 


fc-colored 



FIGURE 1. Different labellings. 



For each of these tagged objects, we can easily define the notion of combinatorial 
classes. 

We essentially study in this paper the notion of colored and fc-colored classes. 
The semilabelled classes are the topic of another work. 

Let A be a combinatorial (unlabelled) class, we denote by A n the set of objects 
of size n. 

1.1. Constructible combinatorial classes. A constructible combinatorial class 
A(Z) is a class which can be build from neutral classes and an atomic class Z 
by using some of these builders (+,x,Seq,MSet,PSet,Cyc,...). (the builders Seq, 
M Set, PSet, Cyc cannot be applied to classes having neutral objects.) For instance, 
MSet{Z x Seq(Z)). We also accept the recursive construction : T = /(T). 
These constructions are exactly the ones that we recurrently find in modern theories 
of combinatorial analysis. We do not develop here the precise grammar of building. 
This can be found for instance in [5]. 

A colored constructible combinatorial class (resp. k- colored constructible combi- 
natorial class) is build from a constructible combinatorial class A{Z) as follows : 

oo 

A= |J A n {Z 1 + . . . + Z n ) (resp. A [k] = A{Zi + . . . + Z k )). The unique atom of the 

n=0 

atomic class Z k corresponds to an atom colored with the color k. Strictly speaking, 
A and A^ are not combinatorial tagged classes as defined in the previous section. 
But they can be obviously identify by expressing the colors in terms of function on 
the atoms. 
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In the sequel, we denote identically the colored constructible combinatorial class 
and its associated unlabelled combinatorial class. For instance, we could speak 
about the colored constructible combinatorial class MSet(Z x Seq(Z)) to say that 
we consider the colored constructible combinatorial class build from the unlabelled 
constructible class MSet(Z x Seq(Z)). 

2. Profiled combinatorial classes 

Let A be a set, a subset S of V{A) (the set of all parts of A) is called a set- 
partition of A if and only if the elements of S are disjoint, S does not contain 
unless that A = and the union of elements of S are equal to A. 

A profiled combinatorial object (a, a a ) is a combinatorial object a equiped with 
a set-partition a a of atom(a) which represents its profile or symmetry. 

Remark : In the classical Polya theory pQ, we deal with an automorphism s of 
atom(a) but this is overdimensioned according to what we need. We only want to 
mark the undistinguished atoms, a partition is clearly sufficient. 

Definition 2.1. A profiled combinatorial class is a multiset of profiled combinato- 
rial objects with a finite number of object of each size. 

Now, we can define the product and the diagonal of profiled combinatorial objects 
as follows : 

Product (concatenation). We define the symmetry a m of m = b x c as ObUo c . 
In other words, (b x c, uj U er c ). 

Diagonal. We define Ska — a. ..a where a is repeated k times. 

The symmetry of Ska is defined as follows : Let {Pi, P m } be the symme- 
try of a then the symmetry of Ska is {(J P\ , [J P^} where P[ corresponds to 

i i 

Pi but on the atoms of the z-th occurence of a. For instance, let {a, o~ a ) = 
(({[□□](□□)}□), {{1,3}, {2, 5}, {4}}) (the first □ is linked to the third □...), then 
5 2 {a, a a ) is (aa, {{1, 3} U {6, 8}, {2, 5} U {7, 10}, {4} U {9}}) 

A coloration of a profiled object of size n and symmetry a is consistant if on each 
subset of a every atom has the same color. We also say that a profile is c-admissible 
for a coloration c if each part of a is monocolor. 

2.1. Constructible profiled combinatorial classes, the constructible profiled 
combinatorial classes are exactly the classes that we can buld with the following 
list of operators : 

• Atomic class : Z = {□}, the set-partition is reduce to the singleton. 

• Union : A U B is the multiset-union of the classes. 

• Product : A x B = {(a x b, a a U en,}; a G A, b e B} 

oo 

• Sequence : Seq(A) = [J A % 

• Diagonal : Afc(_4) = {5ka;a G .4} 

• MultiSet : 

MSet(A) — {{B};B a concatenation of diagonals of elements in .4} 

• Cycle : 

Cyc(A) = { [B ] ; B a diagonal of concatened elements of ^4} 
A classical tool associated to combinatorial classes is the notion of generating 
functions. We explain below the building rules for the generating multivariate 
functions on the infinite many variables s±, S2, ■■■ associated to constructible profiled 
combinatorial classes. The variable are going to express a set of i linked atoms. 
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For example, this profiled general non-planar tree of size 14 : 



□ 




□ □ 



has Sj.S2.S6 as associated monomial. 

In fact, the multivariate generating function associated to constructible profiled 
combinatorial classes can be defined inductively as follows : 

• Neutral class : Fs = 1 

• Atomic class : Fz = si 

• Sum : F A+B = F A + F B 

• Product : F Ax b = F A x Fb 

• Sequence : F Seq(A) = ^ —y- 

• Diagonal: F Ak (a)( s i> s 2, ■■■) = F A (s h ,s 2k , ...) 

• MultiSet : F MSet{A) = ex^>Y.k>\ I F A k (A) 

. Cycle : F Cyc[A] = £^ In(l - F Ak{A) ) 

Let us notice that F A is not an enumerative generating function for the pro- 
filed objects in A. But it has the following very interesting property. Let us 
put f A (x,t) = F A (t.x,t.x 2 , ...,t.x k , ...). It is a classical result of Polya theory that 
[x n ]f A (x, t) — \A n {2i + . . . + Z t )\. The subtitution of Sj by t.x 1 only says that the 
set of i linked atoms represented by Sj can exactly be colored in t different ways 
and it must be considered as an object of size x l . So, the function f A (x,t) is the 
ordinary generating function for the f-colored combinatorial class A^ . In a sense, 
the generating function of F A contains the informations of all the generating func- 
tions for the t-colored combinatorial classes. In fact, this multivariate generating 
function is generally called cycle index sum and it is also fundamental in 0]. For 
instance, consider S = Mset(Z), the first terms of the profiled generating function 
are F MSet[z) = 1 + s x + sf/2 + s 2 /2 + sf/6 + sis 2 /2 + s 3 /3 + .... To obtain the 
enumerative generating function for the 3-colored Mset(Z), it suffices to replace Si 
by 3a; 1 . So, we find F MSet{z) = 1 + 3x + 6x 2 + I0x 3 + ... 

Remark : In constructive profiled classes, every symetries are only obtained 
by the diagonal operator. So, in the implementation, we do not need to stock 
the partition, we simply keep the diagonals unexpanded. That's why our profiled 
samplers return only an "object" a, instead of a couple (a, a) as in our proofs. 
Moreover, this choice induces a storage gain, as big as the object has symmetries. 



3. Probability and Boltzmann Samplers 

The principle of Boltzmann samplers can be describe as follows : Let A be a 
combinatorial unlabelled class and A{x) its ordinary generating function. Consider 
a non negative real number x in the convergence disk of A(x), a Boltzmann sampler 
TA is a random generator that draws each object a E A with probability P(a) = 
-j^pj- There are simple rules, described in [5] to build automatically Boltzmann 
samplers from the combinatorial specification of A. 
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For our purpose, we want to draw t-colorcd objects coming from a specified class 

x \a\ 

A(Z) with the following Boltzmann probability P t x (a) — -. 

We produce samplers for (non colored) profiled objects with the following Boltz- 
mann distribution : the probability to obtain a profiled object (a, a) such that a 

has rii parts of size i is —— . We then prove that ai- 

ter a consistant coloration and a good choice of the parameters, we also obtain a 
Boltzmann sampler for the i-colored objects. The notion of powersets is not very 
relevant on profiled combinatorial classes. Indeed, in a profiled object, some atoms 
are undistinguished, but there is no information on distinguished atoms, they can 
be the similar or not. Below, a tabular with the classical distributions of probability 
and the design rules for basic sampler constructions where TA designs a Boltzmann 
sampler for the class A. 



Distribution Notation Definition 



Bernoulli Bern(p) P(0) 

Geometric Geom(A) P(fc) 

Poisson Pois(A) P(fc) 

Positive Poisson Pois> (A) P(fc) 



= 1 - p and P(l) = p (with < p < 1) 
= A fc (l - A) (with k G N and < A < 1) 

\ k 

= e~ x — (with k e N and A G R+*) 

k 

= — !— 4t (with k e W and A G R+*) 
e A — 1 k\ 



Figure 2. Distributions of use in Boltzmann sampling. 

In the sequel, the symbol <— designs the affection. 

• Neutral class : r£( Sl S2! ) := return () 

• Atomic class : I\Z( Sl)S2) _) := return □ 
. Sum : r(^ + S) (si;S2i .'..)":= 

Draw X following a Bernoulli law of parameter Bern( jjdifilfHmj,^ 

Fa+b(si, s 2 , ■■■) 

If X — 1 return TA( Sl jS2 else return rS( SliS2i ...) 

• Product : T(A x B)( Sl)S2 ,...) := 

Let A <- r^4 (ai)S2i ...) and B <- rS (si;S2i ...) 
return (AB) 
. Diagonal : rA fc (^) (si , S2 := 
let A <— r^4( Sfc)S2fc) ...) 
return 5k A. 

• Sequence : See Algorithm 1. 

• MultiSet : See Algorithm 2. 

• Cycle : See Algorithm 3. 

Proposition 3.1. The previous samplers are valid Botzmann samplers for profiled 
objects. 

Algorithm 1: rSeg(^) (siiS2) ...) 

Input: the parameters Si,S2... 
Output: a sequence. 

1 Draw k following the geometric law Geom(FA.(si, s 2 , ...)). 

2 for i from I to k do 

a L- l <- [ A - *. 

i return (Ai..A k ). 
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Algorithm 2: TMSet(A)( Sua2t _) 



Input: the parameters s\,S2--- 
Output: a multiset. 

1 let K be a random variable in N* verifying 

¥(K<k) = J] exp(i^( Sj)S2j ,...)) 

j>k 

2 Draw k following the law of K. 

3 S^e 

4 if k = 1 then 



Draw q following a Poisson law of parameter F A (si, s 2 , ■■■ 
for i from 1 to q do 

Ai <— r^4( S1S2 ) 

5 <- Concat(S, Ai). 
return {S} 



10 else 



n 

12 
13 
14 
15 

16 
17 
18 
19 

20 



for j from 1 to k — 1 do 

Draw q following a Poisson law of parameter jF A (sj,S2j, ■ ■■)■ 
for i from 1 to q do 

S <— Concat(S, Ai). 

Draw q following a Poisson>i law of parameter £-FU(sfc, s 2 fc, ■■■)■ 
for i from 1 to g do 

4^rA t (i) (siA ...) 

S <- Concat(S, Ai). 
return {S 1 } 



Algorithm 3: rCyc(^) (sii 



Input: the parameters si,s 2 ... 
Output: a cycle. 

1 let if be a random variable in N* verifying 

^ K = k ) = -F^^ Hl-F Ak(A) ) 

2 Draw k following the law of K. 

3 let L be a random variable in N* verifying P(L = I) = Ak £ A) ln ( 1 _^ A — 

4 Draw I following the law of L. 

5 M <- e 

6 for i from 1 to / do 

A; <— rA( Sfc;S2fc; ...) 

M <- Concat(M, Ai). 



9 return [<5fcM]. 



Proof of the proposition 3.1. We only prove that for the sampler for MSet and Cyc. 
The other ones are easy exercices. Let us begin with the MSet sampler. First, 
suppose that during the execution of the algorithm, we have drawn k = 1. So, 
¥(K = 1) = cxp(F^(si, s 2 , ...)). After that, we have choosen q with probability : 

P(q) = cxp(-F A ( Sl , s 2 , ...)). ^ 
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and we have drawn q objects in A, each of them has profile a such that the number 
of parts of a of cardinal i is rn with probability : 

_ [s n ]F4( S(n)eN ,).s n 



^U(«(n)€N*) 

where we denote s^sVf ... by s n . So, the probability to obtain an object of profile 
a such that the number of parts of a of cardinal i is rii (when we draw k = 1) is 
the product of all these terms. This can be simplify in : 

ft [s md ]^(s ( „ )eN .) 

s \ ~> \ -> d=l 

00 1 2-^i 2-~i q\ 

EI -FA(Sj, S 2 j, •■•) 9=0 (mi,...,m q );X;m d =n 
j=l J d 

Now, for k > 1, in the same way, we can show that the probability is 

k q P 

n n is m ^}F A ( S{n)eK ,). s ™^ 

^-v t p=l d=l 

fe>l?i>0,...,g fc >0(m 1 ,i,...,m fc , 9fc ); J] -^( s i: s 2j, •••) 

fc 9, j = 1 3 

E E m .j= n 

i=ij=i 

So, using this sampler, the probability to obtain an object of profile a such that 
the number of parts of a of cardinal i is n, is 

fc <?p 

s n n n [s m - d ]^( S(Il)eN o 

^ ^— ^ 1910-1 I o t l ' 00 1 

k>l?i>0,...,9 fc >0(mi, 1 ,...,m fc ,, fc ); {[ - F A {Sj , S 2j , ...) 

E £ m «,j= n 

Indeed, this is all the ways to obtain an object of such a profile. 

Now, [s n ]F MSet( ^)(si,S2, •••) = E I\l sn '} ex P(-j F A(sj,s 2j , ...)). Moreover, 

J2 n j= n 3 

i 

[ S " j]e xp(i^ (Sj , S2j , ...)) = £! 2 ny-'i^f^. 

7 a=0"- « _ p=l 7 

n 3,P — n 3 

p=l 



So, 



a! 

j=l a— « p— 1 



E «3=n E 
j = i p=i 



We have to change the order of the sums and products. We use two times that 

00 00 

II J2 a a,j — II a aj,j 7 an d we obtain that the probability to draw an object 

j=laeA aei™ j=l 

. s n .[s n ]F MSetM) ( Sl ,s 2 ,...) 
of profile a is — ^ . 

FMSet(A) s 2, ■■■) 

Now, we analyse the sampler for Cyc. Suppose that the algorithm returns an 
object of size n and profile a such that the number of parts of a of cardinal i is rij. 
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The probability p n to draw such an object is 

l 

E E P(K = k).P(L = l).l[p mi 

k.l—n l i—1 

(mi,...,mi); J2 m d =n 

d=l 

with P(K = k) = L ln(l - F Ak(A) ), 

p( £ = ;) = ( f 4(>t)( S (n)a '))' I 



I ln(l - F AfcM) (s (n)eN .)) 

s m ']F Afc( ^( S( „ )eN ,).s m ' 



and Mi e {0...i},p m4 
So, 

?n= E E i? n ^lt[^ F ^Mn)e^)^ 

fe ^„ ^ F Cyc{A) (s {n)en .) k 111 



(mi,...,m;); £ m d =n 
d=i 



[ sn ]^Cye(.4)(g(n)gN*)-S n 

-Pcyc(^)( s (n)eN*) 

□ 



4. BOLTZMANN SAMPLERS FOR i-COLORED CLASSES AND COLORED CLASSES 

4.1. The t-colored Boltzmann Samplers. In this section, we mention how to 
obtain from the sampler for profiled objects the samplers for the i-colored objects. 

Proposition 4.1. Let us substitute Si by t.x 1 and generate a profiled object (a, a) 
as previously. For each part p of a, draw uniformaly a color c p in {1, ...,t} and let 
us assign the atoms in p with the color c p . 

This sampler is a valid Boltzmann sampler for the t-colored objects in the class 

A. 

Proof. Let us consider a ^-colored object a drawn by this sampler. Suppose that a 

is of size n and have ni atoms colored with the color i. The probability to obtain a 

is the sum on the a-admissible profiled objects times the probability to put the good 

II (t.xT' K a .[s n ]F A (t.x,t.x 2 ,...) 
colors, bo, we have: P(a) = ^ Tr~ ( 2 r • y=— = 

(Tadmissible profile k A (t.X, t.X , ...) t^ n ' 

x \a\ 

-7— r. K <y- [ sn ]^-4 {t- x > t.x 2 , ...) where n a is the number of =s -equivalent 

JA.\ X 1 (Tadmissible profile 

coloration that corresponds to the color of a. 

It can be proved that K a .[s n ]F A (t.x, t.x 2 , ...) = 1. 

(Tadmissible profile 

Another proof of this proposition is just to observe that after the substitution of 
Si by t.x 1 , our sampler is syntaxically identical to a classical Boltzmann sampler for 
/c-colored objects, except that we have postponed the Bernoulli choice of the color 
by the informations stored in the profile. □ 



So, we have valid Boltzmann samplers for the t-colored objects. We can use it 
for exact size generation as well as approximate size sampling. 
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4.2. Approximate-size Boltzmann Samplers for size-colored objects. Now, 
we want to use profiled object samplers to generate colored objects. Suppose that 
we want to generate a colored object of size n in a combinatorial class A. First, we 
have to solve the following equation to find the parameter xo which targets correctly 
-§-fj^{x,n) 

the samplers : x. ' r— = n. We draw a profiled object with the parameters 

JA{x,n) 

Si = u.Xq. Now, if we color this object with n colors, the sampler is uniform only 
on the class A$ that is to say when we have drawn a profiled object of size n. We 
unbiais the approximate size generator by adding some rejections. The following 
lemma explains how we filter the sampler in order to correct it. 

Lemma 4.2. Let A = {x\, Xk} be a set of objects and V a sampler for A that 
draws the object Xi with probability pi ^ 0. Let (p[ , . . . , p' k ) another distibution of 

pi 

probability on A, and suppose that the — forms a decreasing sequence. Consider 

Pi 

the following sampler V : 



Algorithm 4 : f" 



Input: the distribution of p\ 
Output: an object. 

1 Draw an object x with T 

2 Let i such that x — xi 

Draw X following a Bernoulli law of parameter 2lEL 

3 Pi-P\ 

4 if X = 1 then 

5 | return x 

6 else 

7 |_ restart V 

Then V draws the object Xi with probability p[. Moreover, V calls on average 

P' 

— times T to obtain an object. 
Pi 

Proof. First, observe that the parameter *' ) is in [0,11. Put Xi = *' ) , the 

Pi-Pi Pi-Pi 

probability to obtain Xi is (J2Pj%j) n Pi x i ~ "j — % = ' ■ I n this 

«=o j i-/^Pj x j l^Pj x j 

j 3 

Pi^ 

i i • i i , Pi-Pi , % Pi-P'\ P'i , rp, , , 

expression, substitute xi by -, we get ; = — — - = p.. the expected 

Pi.p\ PyPl 22 Pj 

Y j P3-p\ 3 

OO 

time is ^ (n + l)(J2Pj%j) n Pi x i- A similar calculation shows that this is equal to 

n— j 

Pi 

Now, consider the problem for sampling for colored objects of size around n, 
a sampler is described in Algorithm 5. In a first time, we propose to observe its 
running on an example : 



Example 4.3. A generation for a colored general tree of specification T = Z + 
ZMSet(T) by Algorithm 5, with parameters 14, x . 



12 



O. BODINI, A. JACQUOT 



First, a profiled tree A is drawn by Algorithm 2 with Sj = 14. Xq : 
We draw k=2. 

For j=l, we draw q=l, consequently we draw an only profiled object using rAiT( Slj 
Then, we draw q=2, this involves the sampling of 2 profiled object by rA2T( Sl;S2j .. 
After that, we get the following profiled tree A : 



□ 




As the size of A is 14 = n (So lucky, we are), we just have to return a 1^-coloration 
of A respecting its symetries. We find : 




Theorem 4.4. The following algorithm is a approximate- size Boltzmann sampler 
for the size-colored objects in A. Its overall cost is 



O ( n ((I - e) U{x ' n) + (1 + £ )( 1+e >" SaM \ 
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Algorithm 5: T, 



colored 



Input: the parameters ti,xq 
Output: a colored object. 

1 Draw a profiled object a with the parameters s, = n.x 

2 Let n <— size of a 

3 Let hi the number of parts of the profile of cardinal i. 

4 if h £ [(1 — e)n, (1 + e)n] then 



restart L 



colored 



6 else 



switch f/ie vahte h do 
case u = n 
|^ return a n-coloration of a. 

case n < n 

Let a be the minimum in 
{D n i ! in i = n an d [si 1 S2 2 • 



.]*U(*1, S2,-) 7^0} 



Draw X following a Bernoulli of parameter (— ) 

n 

if X = 1 then 
| return a n-coloration of a 
else 

|_ restart T colored 
case h > n 

Draw X following a Bernoulli of parameter ( — )~"+E> 

n 

it X = 1 then 
| return a n-coloration of a 
else 

|_ restart T co i ored 



Proof of the correctness ofT co i orec i- Using our sampler of profiled objects with the 
parameters Sj = n.x l Q , the probability to draw a profiled object of size n having a 

profile with hi parts of cardinal i is P„(h) = — — — . We have to add after 

JA{x,n) 

B^ 8i .I S .[8 a ]J^ 

that a filter in such a way that the probability becomes Ps(h) = ' ' . 

fA{x,n) 

So, we only have to know for which profile the ratio is maximum. If n < n 

(resp. n > n), this happens when ^ n, is minimum belongs the (nj, ri2, ...) such that 
Y^, in* = h and [s 7 ^ 1 s^ 2 ---jF^si, S2, ...) 7^ (resp. when m is maximum belongs 
the (ni,n2, ■••) such that ^* n « = an d [ s i 1s 2 2 ---]^a( s 1i s 2, ■■•) 7^ 0). Notice that 
the maximum is always h. The choice of the Bernoulli parameters follows directly 
from the lemma 

Under classical conditions explain in [5], the sampling in approximate-size of a 
fc-colored constructible structure can be done in linear time. Now, it suffices to 
evaluate the cost of the added rejection phase. The average number of loops is 

(1-e). , f A ^ n \ when ft < n and (l + e)( 1+ «>" otherwise. □ 

f A {x, (1 - e)n) f A (x, (1 + e)n) 
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Remark 4.5. At this stage, the explicit evaluation of the ratio —— — —-- — - is 

f A {x, (1 + e)n) 

out of reach, but it experimentaly seems that it is often better to choose n > n. In 
this case, it is more advisable to modify the line 4 in- the previous algorithm by 
(4') ifn£ [n, (1 + e)n] then restart T colored . 

5. Conclusion 

We have proved that the profiled objects are powerful tools to generate un- 
der Boltzmann model fc-colored objects and to obtain a generator in approxi- 
mate size for colored objects. Surprisingly, we only have to know the minimum 
in {J2 n i'J2^ n i — ™ an d [s" 1 s^ 2 ■■■]F A (si, s 2 , ■■■) ^ 0} to unbiais the generation of 
colored objects. We also expect to develop our ideas to even more general coloration 
types as for instance to impose constraints on the number of atoms with the same 
colors. Finally, we would like to thank E. Fusy, C. Pivoteau and M. Soria for their 
precious comments and corrections. 
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